Pitch and Formants Estimation of Enhanced Noisy Compressed Speech Signal Corrupted By Real World Noise Using Recursive Filter
نویسندگان
چکیده
Speech compression, enhancement and recognition in noisy, reverberant conditions is a challenging task. In this paper a new approach to this problem, which is developed in the framework of probabilistic random modeling. speech coding techniques are commonly used in low bit rate analysis and synthesis . Coding algorithms seek to minimize the bit rate in the digital representation of a signal without an objectionable loss of signal quality in the process. As the compression techniques that are used are Lossy compression technique and there is every possibility of loss in quality. Speech enhancement aims to improve speech quality by using various algorithms. This paper deals with multistage vector quantization technique used for coding (compression) of narrow band speech signal. The parameter used for coding of speech signals are the line spectral frequencies, so as to ensure filter stability after quantization. The code books used for quantization are generated by using Linde, Buzo and Gray(LBG) algorithm. The existing Speech enhancement techniques like spectral subtraction and Kalman filters performances are compared with the proposed recursive filter and approach yields significantly estimating the parameters like signal to noise ratio subjected to white Gaussian Noise and Real time noise signals. KeywordsLinear predictive Coding, Multi stage vector quantization, Line Spectral Frequencies (LSF).
منابع مشابه
New Time-frequency Domain Pitch Estimation Methods for Speech Signals under Low Levels of Snr
New Time-Frequency Domain Pitch Estimation Methods for Speech Signals under Low Levels of SNR Celia Shahnaz, Ph.D. Concordia University, 2009 Pitch estimation of speech signals is the key to understanding most acoustical phenomena as well as accurately designing many practical systems in speech communication. It is to determine the fundamental frequency or period of a vocal cord vibration causi...
متن کاملKalman tracking of linear predictor and harmonic noise models for noisy speech enhancement
This paper presents a speech enhancement method based on the tracking and denoising of the formants of a linear prediction (LP) model of the spectral envelope of speech and the parameters of a harmonic noise model (HNM) of its excitation. The main advantages of tracking and denoising the prominent energy contours of speech are the efficient use of the spectral and temporal structures of success...
متن کاملروشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه
Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...
متن کاملBulletin of the Polish Academy of Sciences
A stable and accurate estimation of the fundamental frequency (pitch, F0) is an important requirement in speech and music signal analysis, in tasks like automatic speech recognition and extraction of target signal in noisy environment. In this paper, we propose a pitch-related spectrogram normalization scheme to improve the speaker – independency of standard speech features. A very accurate est...
متن کاملSpeech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering
Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...
متن کامل